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ABSTRACT 



A method and apparatus for digitally encoding video image 
data, which is particularly suited for encoding Internet Web 
pages, eliminates the need for performing time consuming, 
computationally intensive motion vector searches by taking 
advantage of prior knowledge regarding the Web page 
movement. In a first preferred embodiment, a digital video 
encoder, such as an MPEG encoder, employs Web page 
scrolling coordinates obtained from a browser application to 
determine, through calculation instead of searching, motion 
estimation for all the macroblocks of the present image 
frame relative to the previous frame. In a second preferred 
embodiment, the encoder receives information from the 
browser application that indicates that certain portions of an 
image, such as a Web page animation window, are continu- 
ously changing, and thus should be encoded as an intra 
frame. The invention may be employed in a cable television 
system which includes Internet access capabilities that per- 
mit system users to download Web pages for viewing on 
their televisions from a remote browser application located 
in the cable system's headend. 

18 Claims, 4 Drawing Sheets 
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MPEG ENCODING TECHNIQUE FOR MPEG encoding is a video image compression technique 

ENCODING WEB PAGES tnal substantially reduces the amount of motion picture 

image data that must be transmitted. This data reduction is 

CROSS REFERENCE TO RELATED made possible because spatial redundancy exists within an 

APPLICATIONS 5 image frame (intra frame compression). In addition, each 

,„ _ ct ,^ . ^ succeeding frame in a motion picture video usually contains 

v ^ "PP h f '' on c 'f J Contirmationof U.S. application Ser. substantia f lemporal .edundaii* i.e., portions which have 

No. 09/093,879, filed Jun. 9, 1998, now U.S. Pat. No. eUber not ffom lhe previous k ame> or have only 

6,266,369, issued Jul. 24, 2001. been moved re i ative t0 (he p rev j 0 us frame (inter frame 

BACKGROUND OF THE INVENTION 10 fW™**)- When spatial redundancy is removed from . 

frame, the frame is said to be encoded as an intra-coded 
The present invention relates to a method and apparatus frame (I-frame). In an inter frame compression scheme, two 
for digitally encoding video image data, and is particularly different compression algorithms may be employed to gen- 
suited for encoding Internet Web pages for transmission and erate two kinds of encoded frames. A compressed image 
display. frame is called a Predictive-coded frame (P-frame) if only a 
With' the ever-increasing popularity of the Internet, a 15 prior frame is compared and the mfference is coded. Another 

number of systems and devices have appeared in the mar- in,e 1 r . ^ ame /° mpress }° n f KSU }% > n * Bidirectionally 

, t , t , J t , t „ , *u • i • * predictive-coded frame (B-frame) if both a prior frame and 

ketplace that substantially reduce the initial equipment 1 . - , r v 7 T r 

r -jf .Lt. .i- i a post frame are used for encoding. In these cases, it is not 

expense required for accessing the Internet For example, n £ tQ transmit aU of thc ^ data for each frame 

inexpensive dedicated processors are available which enable 2Q Instead> ^ me ^ nnct dala representing the portions in 

a user to access the Internet using a telephone line, and the CUfrem frame lhat have changed from the neighboring 

download Internet Web pages for display on the user's (previous or later) frame(s) is transmitted. For areas in an 

television set, image which have been moved relative to the previous 

Recently, an even more attractive Internet access system frame, it is possible to search for these areas, and then 

has been proposed which completely eliminates the need for 25 generate a motion vector which instructs a receiving decoder 

a user to have a telephone line and a dedicated processor to construct a portion of the next image frame by moving a 

running a browser application locally at their premises. This corresponding portion in the previous image frame a speci- 

system employs a modified cable television (CATV) system fied displacement and direction. 

that uses the downstream cable channels to transmit To encode a sequence of video frames, the first frame is 

Internet^based information to the system users via for dis- 30 encoded as an intra or I frame where information for all of 

play on their television sets. Each user is provided with a set the pixels in the frame needs to be transmitted since no 

top converter box that has been modified to enable entry of previous frame information is available. The next frame in 

data or commands via a keyboard, remote controller or other the sequence can then be encoded either as an P (predictive) 

input device. One or more upstream channels are provided frame or a B (bi-directional predictive -coded) frame which 

which transmit the entered data or commands to a headend 35 includes only the difference or motion vector data resulting 

server in the CATV system. The headend server is interfaced from the frame comparisons. P or B frames can continue to 

to the Internet via an Internet Service Provider (ISP), for be used for encoding the succeeding frames in the sequence 

example, and includes processing equipment which can until a substantial change, such as a scene change, occurs, 

simultaneously operate a plurality of resident Internet thus necessitating formation of another I frame. In practice, 

browser applications, one for each system user requesting 40 however, the encoder is programmed to encode I frames at 

Internet access. The headend server therefore contains all of a constant rate, such as for every other N frames. The MPEG 

the processing equipment necessary to access the Internet encoding procedure thus compresses images by suppressing 

through the ISP, while each user's set top box acts as an statistical and subjective redundancy inter and intra frames, 

input/output device for interfacing the user to the Internet. An MPEG decoder is capable of decompressing the coded 

In the operation of the system, a user requests Internet 45 image close to its original format so that the decompressed 

access by entering an appropriate command into the set top image may be displayed on a display device, such as a 

box that transmits the command through an upstream chan- television or computer monitor. 

nel to the headend server. In response, the headend server In the Internet Web page display application, only P 

connects the user to one of the resident browser applications frames are usually employed for inter frame compression 

via one of the system's downstream channels. 50 because B frame coding requires comparison with post (later 

The Internet-based information, e.g., Web pages, can be in time ) frames which are not available immediately, 

transmitted through the downstream channel in a number of However, a B frame can be encoded by forward comparison 

ways. In an analog implementation, for example, the Inter- only between the current frame and the prior frame as a 

net data can be inserted into the vertical or horizontal special case, and in this instance, can also be employed for 

blanking intervals of the conventional analog television 55 w eb page inter frame compression, 

signals which are simultaneously transmitted on the selected In the application of MPEG encoding to the previously 

downstream channel. In an all-digital embodiment, described CATV system, each user's set top box includes an 

however, the Internet data must be encoded in the same MPEG decoder for decoding the digital video bit stream 

format that is employed for digitally encoding video signals. received on the downstream channels. This requires that any 

More particularly, the data must be encoded using standard- 60 Internet Web page image data to be transmitted to the set top 

ized procedures for encoding, storing, transporting and boxes also be MPEG encoded. An MPEG encoder is thus 

displaying continuous video frames that have been specified incorporated in the cable headend to encode the browser 

by The Motion Picture Experts Group (MPEG). Thus, the generated Web page image data, which usually is a bit map, 

image bit map generated by the browser application is not before it is transmitted on one of the downstream channels 

rendered at the headend, but instead is further compressed 65 to a user's set top box. 

by an MPEG image encoder. It is the compressed image data In general, however, MPEG encoding of Web page image 

that is transmitted to a user. data is needlessly intensive from a computation standpoint 
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since Web pages do not usually incorporate full motion 
video, and often appear to be nothing more than a still 
image. Strictly speaking, though, the Web page is not a still 
image. Due to the limited viewing size of a display device, 
the Web page is usually larger than the display device's 
viewing area. A user may therefore scroll a Web page to 
move the page horizontally or vertically to view the whole 
page. Depending on the speed at which the page is scrolling, 
the images on the display device may thus be considered to 
be a series of video frames displayed at a variable frame rate. 
Other Web pages may contain a small animation window in 
which several localized pictures are alternatively displayed 
at a certain rate. JAVA applets animation and regional 
character updates which occur as a user types an e-mail 
message are other examples of this local animation scenario. 
In both of these cases, MPEG inter frames may be con- 
structed after the generation of a first, intra fame, to reduce 
the number of bits needed to represent each frame, thus 
substantially reducing the required bandwidth in the com- 
munication link. 

As discussed previously, when an inter frame is 
generated, motion vectors must be found, coded and trans- 
mitted so that the MPEG decoder can reform the frame. A 
motion vector search is one of the most difficult tasks in 
designing an MPEG encoder. Since the MPEG committee 
defined only the syntax and semantics of a compressed 
frame, but did not define how motion vectors searching 
should be implemented, numerous proprietary motion vector 
search algorithms were developed by various encoder ven- 
dors. For continuous video compression, however, a motion 
vector search is very complicated and requires a large 
percentage of the entire encoding computational effort. 
More particularly, in MPEG encoding, each video frame to 
be encoded is subdivided into a plurality of multiple 64 
(8x8) pixel blocks, and four such blocks covering a 16x16 
pixel area are known as a macroblock. During encoding, the 
MPEG encoder searches for the best match between each 
macroblock of a present frame to be encoded with the 
corresponding macroblock in the previous frame. This 
search for the best match is known as motion estimation. 

The existing algorithms for motion estimation fall into 
two categories: feature/region matching and gradient-based. 
In the first category, both block matching and hierarchical 
block matching can be employed for motion estimation. For 
encoding a continuous video, the encoder has to search the 
entire screen (exhaustive search) to find the best match 
because the encoder knows nothing about the motion from 
frame to frame. In gradient -based motion estimation, the 
exhaustive search may be avoided at the price of solving 
linear equations during search. 

All of the algorithms require many iterations to complete 
the motion estimation. After the best match is found, the 
difference between the matched macroblocks is calculated 
by comparing the macroblocks. If the difference is small 
enough, a motion vector is generated which determines the 
direction and offset of the motion. Both the difference and 
the motion vector are encoded and transmitted. If the dif- 
ference is larger than a threshold, the macroblock of the 
present frame is allowed to be intra compressed as one 
encoded in an I frame. 

In view of the foregoing, any video image encoding 
technique that eliminates the need for motion vector search 
algorithms would be desirable in view of the resulting 
substantial savings in computation time and intensity. 

SUMMARY OF THE INVENTION 

The present invention provides an encoding technique for 
encoding low- frame rate video image data, such as Internet 
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Web pages, in which motion vectors are generated without 
search algorithms by taking advantage of prior knowledge 
regarding one or more characteristics of the images. In the 
preferred embodiments of the invention, the image charac- 

5 teristics are provided to an encoder, such as an MPEG 
encoder, from an image generating application, and relate to 
movement of or in the images. 

More particularly, both embodiments of the invention are 
designed specifically for use with CATV systems, as dis- 

10 cussed previously, which include Internet access capabili- 
ties. In these systems, when a user scrolls through a Web 
page, scrolling input signals are sent by the user's set top box 
to the browser application in the headend. These signals 
define the direction of the scrolling and its offset, typically 

15 in terms of x and y coordinates. In addition, the Web pages 
may contain one or more animation windows, the graphical 
content of which alternates or changes every second or so. 
The browser application can easily detect whether one or 
more animation windows is present in the Web page image, 

20 and if so, determine the coordinates of the animation 
window(s). The scrolling coordinate and animation window 
information can also be employed by the encoder to deter- 
mine the exact change between a previous image frame and 
a present image frame that has occurred as a result of the 

25 scrolling and/or animation window movement. With this 
knowledge, a motion vector search is unnecessary, and can 
be replaced with a set of calculations employing the scroll- 
ing coordinates. 
In the first preferred embodiment of the present invention, 

30 the encoder employs the scrolling coordinates to determine 
motion estimation for all of the macroblocks in the present 
frame relative to the previous frame in a single step, and 
without a multiple iteration search. A comparison between 
the macroblock of the present frame and the corresponding 

35 macroblock of the previous frame determined by the motion 
estimation, indicates whether the changed macroblock is the 
same as the corresponding macroblock in the previous frame 
which has been shifted in the direction and amount specified 
by the scrolling coordinates. If so, the motion vector for this 

40 macroblock of the frame has been located, and the motion 
vector and the difference between the macroblocks is 
encoded and transmitted. The process is repeated for each 
macroblock in the frame to generate the resulting inter 
frame. The resulting motion vector calculation and algo- 

45 rithm using the scrolling coordinates requires much less 
computation than a full search algorithm. 

In the second preferred embodiment, the encoder receives 
animation window or other information from the browser 
application that indicates that certain portions of an image 

50 are continuously changing, and thus should be encoded as an 
intra frame. If the browser application detects that one or 
more animation windows are present in the Web page image, 
it determines the coordinates of the animation window(s), 
and passes the coordinates to the encoder. The encoder 

55 knows that only the portions of the Web page enclosed by 
the animation window will undergo changes from frame to 
frame, absent any scrolling operations. Thus, if the encoder 
receives animation window coordinates from the browser 
application, the encoder knows that it can encode the present 

60 frame of the Web page by encoding only those macroblocks 
that are contained in the one or more animation windows. 
These are encoded either as intra macroblocks (no need for 
motion estimation) or as forward predictive coded macrob- 
locks by performing a motion estimation constrained within 

65 the animation window. The remaining macroblocks are 
encoded as zero motion vector blocks, which means that 
they have not changed from the previous frame. 
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BRIEF DESCRIPTION OF THE DRAWINGS from the distribution network 14, it is passed through a 

m r , , „ , . decoder 32 which restores the video programming signals 

The features and advantages of the present invention will and Web page ^ dala t0 their original form for display 

become apparent from the following detailed consideration on the television or monitor 26. The decoder 32 is of the 

of a number of preferred embodiments thereof, taken in 5 same format ^ tne encoder 22, such as MPEG 1 or MPEG 

conjunction with the accompanying drawings, in which: 2, for example. 

FIG. 1 is a block diagram of a CATV system which The CATV system 10 thus allows a system user to 

incorporates Internet access capabilities, and can employ a conduct an Internet session by sending appropriate com- 

digital encoding or compression algorithm based on the mands via the keyboard 28 and/or remote controller 30 to the 

principles of the present invention; 30 headend 12. In response, the headend 12 connects the user 

FIG. 2 is an illustration of an Internet Web page having an t0 one of me browser applications 20, and retrieves the 

animation window therein - requested Internet information from the remote source. The 

~. a . r i *u r i *• visual information generated by the browser application 20 

FIG. 3 is a flow chart of an algonthm for locating portions . , , , , & , , , J . , . , r 

c c j i *■ * ■ is encoded and downloaded to the user s set top box 16 for 

of an image frame which have moved relative to a previous ... >w 

c ° w c „. display on their television or monitor 26. 

image frame as a result of a scrolling operation, said 15 i, , , „, , 

algorithm comprising a first preferred embodiment of the To enoode me Web page image bit map data generated by 

present invention; and lhe browser application 20, the encoder 22 employs any 

. . . ■ , , . , , known conventional algorithm for encoding motion picture 

FIG. 4 is a flow chart of an algonthm comprising a second ^ fra ^ as MpEG J Qr MpEG 2 MotioQ ^ 

preferred embodiment of the present invention for encoding video enC£)ders enc£)de and ^ ^ vide£) daU fe 

video ullages having animation windows therein. comp aring each successive video frame with a previous 

DETAILED DESCRIPTION OF THE frame, and transmitting a predictive-coded or P frame which 

PREFERRED EMBODIMENTS includes information only about pixels in the video image 

that have changed from the previous frame. In motion 

FIG. 1 is a general block diagram of a CATV system 10 1$ pi ctur e video, where each succeeding frame usually differs 

which incorporates elements for facilitating access to the on j y sightly from a previous frame, this technique results in 

Internet by a plurality of system users, and is illustrative of a substantial reduction in the amount of data that has to be 

one type of system with which the concepts of the present transmitted with each frame. A P frame primarily consists of 

invention may be employed. It should be noted that the motion vectors and prediction errors associated with each 

CATV system 10 is illustrated in general form since many of 3Q mac roblock. The motion vector of a macroblock indicates 

its detailed elements are not necessary for an understanding tne di rec tion and magnitude of the motion of the macroblock 

of the present invention. from tDe previous frame to the present one. All motion 

The CATV system 10 includes a cable headend 12 and a vectors are coded using variable length coding (VLC). The 
cable television distribution network 14 for interfacing the prediction errors identify the difference between the mac- 
headend 12 to a plurality of set top converter boxes 16. A 35 roblock in the previous frame and the predictive one in the 
plurality of bi-directional transmission links 17 intercon- present frame shifted by the motion vector. The prediction 
nects the set top converter boxes 16 with the distribution errors are transformed into frequency domain by using a 
network 14, each of which includes a plurality of down- Discrete Cosine Transform whose coefficients then are 
stream channels 18 and one or more upstream channels 19, coded by VLC. This technique also results in a substantial 
For clarity, the details of only one of the set top boxes 16 and ^ reduction in the amount of data that needs to be transmitted 
associated elements are illustrated in FIG. 1. for each image, especially in situations where large portions 

The cable headend 12 receives video programming and of a total image are moving from frame to frame. 
Internet-based information from remote sources (not Since most existing motion vector search algorithms are 
shown), and transmits the video programming and other very iterative in nature, they are computationally intensive 
information through the distribution network 14 to the set 45 and time consuming to employ. The present invention elimi- 
top boxes 16. Typically, the video programming is received nates the need for employing motion vector search algo- 
from the remote source in either an analog format, or a rithms in specific instances which are particularly applicable 
digitally compressed or encoded format, such as MPEG 1 or to the encoding of the Web page image data where additional 
MPEG 2. The Internet-based information, on the other hand, information regarding any changes from frame to frame in 
is typically HTML coded Web pages along with still images 50 the image is available for use by the encoder 22. With this 
coded in JPEG or GIF formats which is employed by one or additional information, the encoder 22 is able to identify 
more browser applications 20 to generate Web page bit map portions of a Web page in a present frame, which have 
images. A video encoder 22, which may be an MPEG 1 or moved or changed from the previous frame, by using a set 
MPEG 2 encoder, for example, is also provided in the of calculations instead of a motion vector search algorithm, 
headend 12 for encoding the Web page bit map images 5s in a first preferred embodiment of the invention, a motion 
before they are transmitted by the distribution network 14 to vector to be employed in the formation of an MPEG P frame, 
the set top boxes 16. The encoder 22 is employed to reduce is generated in which the motion of a Web page or other 
the bandwidth necessary to transmit the data generated by image is a result of a scrolling operation performed by an 
the browser application 20, and to generate an MPEG end user through use of their keyboard 28 or remote con- 
standard compliant video stream. 60 trailer 30. FIG. 2 illustrates a Web page 50 as displayed in 

Each of the set top boxes 16 is interfaced via a terminal a browser application display image 52. The Web page 50 

processor 24 and associated communication links 25 (e.g., includes an animation window 53 that is a rectangular 

cables, infrared wireless links, etc.) to a television or moni- shaped area in which the grap hical content changes or 

tor 26, and one or more input devices, such as a wireless ^alternates periodically, such as once every second. As is 

keyboard 28 and a remote controller 30. As each set top box 65 conventional, tne browser application provides a number of 

16 receives the digitally (e.g., MPEG) encoded or com- scrolling buttons 54 along the right side of the display image 

pressed video programming and Internet-based information 52 to facilitate scrolling operations using the remote con- 
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troller 30 as a pointing device. In the example of the system 
10 of FIG. 1, when a user scrolls the displayed Web page 50 
using either the keyboard 28 or the remote controller 30, the 
scrolling commands are sent to the browser application 20 
which performs the actual scrolling operation. This trans- 
lates the Web page image, thus necessitating the formation 
of a number of a PJrames by the encoder 22 to transmit the 
necessary image change int'o rmation to the user's set tonho x 

16.*^ ■ : 

*^FTG. 3 is a flow chart of the steps employed in the first 
embodiment of the present invention to encode the Web 
page images when a scrolling operation occurs. First, at step 
100, when the browser application 20 detects a scrolling 
operation, it will notify the encoder 22 to form a P frame for 
the operation. In addition to the operation type, the browser 
application 20 will pass two scrolling parameters to the 
encoder 22, these being the scrolling direction and offset or 
distance. The direction of the scrolling is defined as follows: 

(1) Horizontally scrolling left: the content in a window 
contained in the Web page image moves right; 

(2) Horizontally scrolling right: the content in a window 
contained in the Web page image moves left; 

(3) Vertically scrolling up: the content in a window 
contained in the Web page image moves downward; 

(4) Vertically scrolling down: the content in a window 
contained in the Web page image moves upward; and 

(5) Horizontally and vertically scrolling simultaneously: 
the content in a window contained in the Web page 
image moves diagonally. 

As discussed previously, in an MPEG encoder, for 
example, two consecutive frames of a video image, F(n-1) 
and F(n), are employed for motion vector calculation. Sup- 
pose frame F(n) is a result of scrolling from frame F(n-l). 
In this case, the scrolling direction and scrolling offset or 
distance are determined by a pair of scrolling parameters, Xs 
and Ys, which are illustrated in Table 1: 

TABLE 1 





Xs <0 


Xs-0 


Xs > 0 


Ys < 0 


Scrolling diagonally; 


No horizontal 


Scrolling 




upward vertically by 


scrolling; 


diagonally; 




[Ys| pixels and left 


scrolling upward 


upward vertically 




horizontally by {Xs| 


vertically by |Ysj 


by |Ys| pixels and 




pixels 


pixels 


right horizontally 








by Xs pixels 


Ys = 0 


Scrolling left 


No horizontal 


Scrolling right 




horizontally by jXs| 


scrolling; no 


horizontally by 




pixels, no vertical 


vertical scrolling 


Xs pixels, no 




scrolling 




vertical 








scrolling 


Ys > 0 


Scrolling diagonally; 


No horizontal 


Scrolling 




downward vertically by 


scrolling; 


diagonally; 




Ys pixels and left 


scrolling 


downward 




horizontally by |Xs| 


downward 


vertically by Ys 




pixels 


vertically by Ys 


pixels and right 






pixels 


horizontally by 








Xs pixels 



15 



20 



25 



35 



Usually, a Web page scrolling does not require full screen 
scrolling because, (1) an overall fixed window frame exists 
all of the time which does not scroll at all; and (2) there are 
some Web pages containing multiple sub-windows. The 
contents in the sub-windows may be scrolled separately and 
independently. It is assumed that scrolling may move the 
contents only in one of the sub-windows. The display screen, 
therefore, is divided into a scrolling area and a non-scrolling 
area. 

With reference again to the flow chart of FIG. 3, the next 
steps are employed to identify scrolling and non -scrolling 



55 



60 
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areas of a Web page or other image by dividing an image 
frame into a plurality of multiple pixel macroblocks (MB). 
For example, each macroblock can be a square 16x16 pixels 
in size. 

Suppose that a frame, F(n), consists of M rows and N 
columns of macroblocks: 



F(n^{MBij(n)]f*Q, 1, . . . , AM; j-0,1, . . . , Af-l} 



30 



and 



MB itr {P 16i ^ 16J+ M*QA, ■ ... 15} 



(1) 



where p MfV (n) is a pixel at coordinate (u, v) in the nth frame. 

The motion vectors are calculated in the following man- 
ner. Each macroblock in the present frame is examined one 
at a time. For example, the process is started with the 
macroblock on the left top corner of the frame, i.e., i=0 and 
j«0. 

First, at step 102, the macroblock in the present (nth) 
frame is compared with the macroblock in the same location 
in the previous or (n-l)th frame to determine if the differ- 
ence in pixel intensities between the two macroblocks is 
smaller than a predetermined threshold Th as determined by 
equation 2: 



WMBijW-MBiJn-DW-- 



(2) 



30 



If the inequality in equation 2 is satisfied, the macroblock 
MB;j is considered to be in the non-scrolling area of the 
image. In other words, the macroblock has not changed from 
the (n-l)th frame to the nth frame. Thus, the motion vector 
associated with this macroblock is set to zero at step 103. 
That is, 



40 



A*V,>M0,0) 



(3) 



If, on the other hand, the inequality in equation 2 is not 
satisfied, then the algorithm proceeds to step 104 in which 
the macroblock in the nth frame is compared with the same 
45 macroblock shifted by either Xs or Ys in the (n-l)th frame 
by using the following equation: 

\\MB itJ in) - MB i+X /i6j*r 4 n6(n - 1 )|| = (4) 
is 

50 XI UPMi*">i*J+*l n )~ Pl6i+Xs,n.l6j+Y s +v(n - 1)1) 2 < Th 



If the inequality in equation 4 is satisfied, the macroblock 
M ti /n) is said to be in the scrolling area, and can be 
reconstructed at step 106 by shifting macroblock MB l>A - j/16> 
/+y /16 (n-l) by -Xs and -Ys pixels. Notice that the shifting 
oflsets, X s and Y^ may not necessarily be multiple of 
integer 16, therefore, the macroblock may not align with 
macroblock boundary. It follows from the definition of a 
motion vector that the compensation motion vector for this 
macroblock MBjn) is: 



65 



(5) 



If both of the inequalities in equations 2 and 4 are not 
satisfied, the macroblock MB.-^n) cannot be identified using 
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a motion vector, and must be encoded at step 108 as an ceeds to step 206 to adjust the macroblock boundary align - 

intra-macroblock. ment so that each animation window is expanded to its 

Once either a motion vector is generated for the macrob- nearest macroblock boundaries. The following equations are 

lock using equations 3 or 5, or the macroblock is encoded as used for the macroblock boundary alignment adjustment: 

an intra-macroblock, the algorithm proceeds to step 110 in 5 

which the indices, i and j, are incremented, and are checked = fi. « ie t8) 

to see whether the last macroblock in the frame has been 16 

encoded. If not, the program returns to step 102 to encode y t = Zi *ie 

the next macroblock in the frame. If the last macroblock has 16 

already been encoded, the encoding of the P frame is 10 y = Hi m ^ + 15 

complete, and the procedure is repeated for the next frame. 16 

The foregoing motion vector calculation algorithm ^ 

requires much less computation than a full screen search y 2 

algorithm. Furthermore, the best case will be a found match y *-~ \Z * 16 + 15 

between two macroblocks in the first step 102 of the is 
procedure. In this case, the search for the motion vector is 

done in one step for the macroblock. It is assumed that the ^ four equalities of equation 8 are integer operations 

neighboring macroblocks will fall in the same kind of area which § enerate a new L ^ °/ coordinates that define a new 

(scrolling or non-scrolling). If a macroblock is found in a animation window whose frame boundary aligns with the 

scrolling area, then for the next macroblock on the same 20 macroblock boundaries, and is defined as: 
slice, step 102 may be switched with step 104 in the process 

to check whether the macroblock is in the scrolling area first, ' y^ and (* 2 ', y 2 t )\a*x l '<x 2 , <s jn 0 ^'-ey/^} (?) 
since it is likely that this is the case In other words the Since ^ of the macroblocks ^ an ^ frame m 
conditional probability that a macroblock MB i+1 » will fall mutually exclusive (no overlapping), the animation window 
in a scrolling area, given that the adjacent macroblock, 25 ^ equation ( 9) can be re defined as: 
MB^n) is in the scrolling area, is much higher than the 
likelihood that the macroblock MB t+1)/ (n) is in a non- 
scrolling area as defined by the following equation: ^/-{iffi^fc'sie/a,', y^iQ^*)} (10) 

Equation 10 defines a window that consists of all of the 
30 macroblocks enclosed by the window. This equation is 

/>{Aftr t>1 >)eS|Art^ ( 6 ) therefore used at step 208 to verify whether a particular 

where S is a scrolling area. Similarly, if a macroblock is ^block belongs to the animation window or not. 

found in a non-scrolling area, no switch between step 102 ° Dce * e macroblocks defining the one or more animation 

and step 104 is necessary for checking the next macroblock have bc ? n ^nmned at step 208 using the fore- 

for the same reason. Therefore, most of the macroblocks 35 go^g equation the next step 210 is to encode the Web page 

need only one step to find their motion vectors. Of course, data *? % P * ame * F °J" a glVe ° ™*»> wmdow ° f 

the worst case scenario will be when no matches are found a Web PJ*^ f° llowin S ^ubsteps of step 210 are employed 

in either step 102 or step 104. However, this should rarely £ encod ^ * e P fram K e ; A < ste P 21 f 2 ' the a gonthm starts with 

happen in a normal scrolling operation, with the exception the ?°P le * P?™" 0 * m f ^me (i=0 and j-0) and 

c .L u 1- .u if- j . . 40 continues the following algorithm for all of the macroblocks 

of the case where both scrolling and animation occur simul- . . Z /\" r j7 ■ 

. , in the frame. If a macroblock MB lV (n) is found by equation 

Turning now to a second preferred embodiment of the 10 . t0 te an e !^f nt ' m animation window then two 

invention, FIG. 4 is a flow chart illustrating the steps carried °P tl0as are avallable t° encode the macroblock. If encoding 

out by an algorithm which detects an animation window in ,lme * a ma J or ."^derat.on, " will be simply encoded at 

a Web page, and encodes the Web page image data in 45 ste P 2 " aS I( a °."? tra macrobl °?* at the ? nce of mo f e b >J» 

accordance with the animation window coordinates. Return- f nerated ' K compression ratio is more preferable 

ing briefly to FIG. 2, the animation window S3 can be due «o .communication bandwidth hmitat.on any motion 

defined by x and y coordinates. Let Sx and Sy be the total f unatlon al S onthm ma y be e "P lo y ed at ste P 215 to search 

number of pixels in the x and y directions, respectively, of for moUon vector * Wllhul lhe wunwuoii ^ This 

the entire browser application display image 52. The ani- 50 P mcess 18 not " lntenslve as P**™«« a m ° b °° 

mation window 53 can then be defined as: vector ° u f the ™ image since the animation 

window is much smaller than a full screen. If a macroblock 
MB iV (n) is not located within the animation window, it is 

yi) and (** yJ\0£xi<xi<Sx> o%i <y 2 <^y} CO encoded at step 216 as a forward compensated macroblock, 

55 with a zero motion vector. Basically, this macroblock can be 

With reference to FIG. 4, a first step 200 in this embodi- skipped if it is not at the very beginning or the end of a slice, 

ment of the invention is for the browser application 20 to In other words, this macroblock will not have changed from 

detect whether one or more animation windows is/are the previously encoded frame, and thus no new information 

present in the currently displayed Web page. If so, the need be encoded for this macroblock. Thus, only those 

browser application 20 passes, at step 202, the x, y coordi- 60 macroblocks which are included in any detected animation 

nates of the one or more animation windows to the encoder windows that are present in the Web page will need to be 

22. encoded by the encoder 22, and transmitted to the set top 

Next, at step 204, the encoder 22 determines whether the converter box's decoder 32. Finally, at step 218* the next 

animation window coordinates align with the macroblock macroblock, if any, in the frame is selected, and encoded by 

boundaries employed by the encoder 22. In the vast majority 65 returning to step 212. 

of cases, the animation window(s) will not line up with the In a case where both scrolling and animation occur 

macroblock boundaries. In this case, the encoder 22 pro- simultaneously when a user scrolls a Web page with ani- 
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mation window(s), the browser application must pass a 
scrolling operation type to the encoder. The inequalities of 
both equations (2) and (4) in the first preferred embodiment 
of the invention will not be satisfied for the macroblocks in 
an animation window during scrolling. Those macroblocks 
will be automatically encoded as intra macroblocks. 

In conclusion, both embodiments of the present invention 
provide simplified video encoding or compression for 
encoding image data, particularly Web page images. In both 
embodiments of the present invention, knowledge of prior 
operation conditions completely eliminates the need to per- 
form motion vector searches during encoding of a P frame. 
Instead, the prior knowledge enables the motion vectors to 
either be calculated by given equations, or set to zeros. This 
efficient technique makes it feasible to create an MPEG 
encoder, for example, in _a real time software program . 

Although the invention^has oeen disclosed in terms of a 
number of preferred embodiments, it will be understood that 
numerous variations and modifications could be made 
thereto without departing from the scope of the invention as 
defined in the following claims. For example, although the 
preferred embodiments are directed specifically to encoding 
of Internet Web pages, the invention can obviously be 
applied to any video or image encoding application where 
prior knowledge of the changes or movement in the video or 
images is available so that motion vector searches need not 
be utilized to generate P and B frames. In addition, the 
invention is not limited to use with MPEG encoders for 
generating P frames, and can be employed with other video 
encoding schemes and/or encoded frame types. 

What is claimed is: 

1. A method for encoding multiple frame image data 
comprising: 

a) supplying a sequence of image frames to an encoder; 

b) providing information to said encoder identifying a 
characteristic of said sequence of image frames; and 

c) encoding at least a portion of said at least one frame in 
said sequence without employing a motion vector 
search algorithm by employing said characteristic iden- 
tifying information and image information relating to at 
least one previous image frame in said sequence. 

2. The method of claim 1, wherein said sequence of image 
frames is generated by an Internet browser application, and 
comprises sequential images of one or more Internet Web 
pages. 

3. The method of claim 1, wherein said characteristic 
identifying information comprises scrolling coordinate 
information. 

4. The method of claim 3, wherein said step of encoding 
further comprises: 

1) calculating at least one motion vector for a frame to be 
encoded using said scrolling coordinate information; 
and 

2) encoding said frame to be encoded as an inter frame 
using said at least one motion vector and said image 
information for a previous frame. 

5. The method of claim 4, wherein said characteristic 
information further includes location information identify- 
ing the location of at least one animation window in an 
image, and said step of encoding further comprises encoding 
a portion of said frame containing said animation window as 
an intra frame. 

6. The method of claim 3, wherein each of said frames in 
said sequence is comprised of a plurality of multiple pixel 
macroblocks, and said step of encoding further comprises: 

1) comparing pixel intensities in a first macroblock of a 
frame to be encoded with pixel intensities of a corre- 
sponding macroblock at a same location in a previous 
frame; 
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2) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 
macroblock is below a predetermined threshold, then 
setting a motion vector to zero for said macroblock to 

5 be encoded, and, if any uncoded macroblocks remain in 
said frame to be encoded, returning to step 1 for a next 
macroblock in said frame; 

3) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 

!0 macroblock is above a predetermined threshold, then 
comparing said macroblock to be encoded with a 
corresponding shifted macroblock in a previous frame 
that is shifted in direction and offset in accordance with 
said scrolling coordinate information; 
15 4) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 
shifted macroblock is below a predetermined threshold, 
then employing said scrolling coordinate information 
to calculate a motion vector for said macroblock to be 
20 encoded, and, if any uncoded macroblocks remain in 
said frame to be encoded, returning to step 1 for a next 
macroblock in said frame; and 
5) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 
25 shifted macroblock is above a predetermined threshold, 
then encoding said macroblock to be encoded as an 
intra-macroblock, and, if any uncoded macroblocks 
remain in said frame to be encoded, returning to step 1 
for a next macroblock in said frame. 
30 7. The method of claim 6, wherein steps 3 and 4 are 
performed before steps 1 and 2 for a next macroblock to be 
encoded if the difference in pixel intensities between a 
present macroblock to be encoded and said corresponding 
shifted macroblock is below a predetermined threshold, and 
35 the next macroblock to be encoded is adjacent said present 
macroblock to be encoded. 

8. The method of claim 1, wherein said characteristic 
identifying information, comprises location information 
identifying the location of at least one animation window in 

40 an image, and said step of encoding further comprises 
encoding said frame to be encoded using said location 
information. 

9. The method of claim 8, wherein each of said frames in 
said sequence is comprised of a plurality of multiple pixel 

45 macroblocks, and said step of encoding further comprises: 

1) employing said location information to identify one or 
more macroblocks in a frame to be encoded which 
include at least a portion that forms part of said at least 

5Q one animation window; 

2) encoding said one or more macroblocks as intra- 
macroblocks; and 

3) encoding any remaining macroblocks in said frame to 
be encoded as forward predictive coded macroblocks, 

55 with a zero motion vector. 

10. The method of claim 8, wherein each of said frames 
in said sequence is comprised of a plurality of multiple pixel 
macroblocks, and said step of encoding further comprises: 

1) employing said location information to identify one or . 
60 more macroblocks in a frame to be encoded which 

include at least a portion that forms part of said at least 
one animation window; 

2) encoding said one or more macroblocks by employing 
a motion vector search algorithm; and 

65 3) encoding any remaining macroblocks in said frame to 
be encoded as forward predictive coded macroblocks, 
with a zero motion vector. 
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11. A system for encoding multiple frame image data 
comprising: 

a) an image generating application for generating a 
sequence of image frames to be encoded, said appli- 
cation including information identifying a characteris- 5 
tic of said sequence of image frames; and 

b) an encoder for receiving said sequence of image frames 
from said application and encoding said frames, said 
encoder obtaining said information from said applica- 
tion identifying a characteristic of said sequence of 30 
image frames, and encoding at least a portion of at least 
one frame in said sequence without employing a 
motion vector search algorithm by employing said 
characteristic identifying information and image infor- 
mation relating to at least one previous image frame in 15 
said sequence. 

12. The system of claim 4, wherein said image generating 
application comprises an Internet browser application, and 
said sequence of image frames comprise sequential images 

of one or more Internet Web pages. 20 

13. The system of claim 12, wherein said browser appli- 
cation and said encoder are disposed in a cable television 
system headend, and a plurality of downstream communi- 
cation channels is provided for transmitting encoded Internet 
Web page data from said encoder to one or more remotely 25 
located set top converter boxes. 

14. The system of claim 11, wherein said characteristic 
identifying information comprises scrolling coordinate 
information which is generated by said image generating 
application, and said encoder includes programming for 30 
generating motion vectors using said scrolling coordinate 
information. 

15. The system of claim 14, wherein said characteristic 
identifying information further comprises location informa- 
tion identifying the location of one or more animation 35 
windows in an image frame. 

16. The system of claim 14, wherein each of said frames 
in said sequence is comprised of a plurality of multiple pixel 
macroblocks, and said encoder further includes program- 
ming for carrying out the following steps: 40 

1) comparing pixel intensities in a first macroblock of a 
frame to be encoded with pixel intensities of a corre- 
sponding macroblock at a same location in a previous 
frame; 45 

2) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 
macroblock is below a predetermined threshold, then 
setting a motion vector to zero for said macroblock to 
be encoded, and, if any uncoded macroblocks remain in 
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said frame to be encoded, returning to step 1 for a next 
macroblock in said frame; 

3) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 
macroblock is above a predetermined threshold, then 
comparing said macroblock to be encoded with a 
corresponding shifted macroblock in a previous frame 
that is shifted in direction and offset in accordance with 
said scrolling coordinate information; 

4) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 
shifted macroblock is below a predetermined threshold, 
then employing said scrolling coordinate information 
to calculate a motion vector for said macroblock to be 
encoded, and, if any uncoded macroblocks remain in 
said frame to be encoded, returning to step 1 for a next 
macroblock in said frame; and 

5) if the difference in pixel intensities between said 
macroblock to be encoded and said corresponding 
shifted macroblock is above a predetermined threshold, 
then- encoding said macroblock to be encoded as an 
intra-macroblock, and, if any uncoded macroblocks 
remain in said frame to be encoded, returning to step 1 
for a next macroblock in said frame. 

17. The system of claim 16, wherein said encoder further 
includes programming for performing steps 3 and 4 before 
steps 1 and 2 for a next macroblock to be encoded if the 
difference in pixel intensities between a present macroblock 
to be encoded and said corresponding shifted macroblock is 
below a predetermined threshold, and the next macroblock 
to be encoded is adjacent said present macroblock to be 
encoded. 

18. The system of claim 11, wherein said characteristic 
identifying information comprises location information 
identifying the location of at least one animation window in 
an image, wherein each of said frames in said sequence is 
comprised of a plurality of multiple pixel macroblocks, and 
wherein said encoder further includes means for: 

1) employing said location information to identify one or 
more macroblocks in a frame to be encoded which 
include at least a portion that forms part of said at least 
one animation window; 

2) encoding said one or more macroblocks as intra- 
macroblocks; and 

3) encoding any remaining macroblocks in said frame to 
be encoded as forward predictive coded macroblocks, 
with a zero motion vector. 
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